Assessing Machine Translation Quality with Error Analysis

نویسنده

  • Maarit Koponen
چکیده

Translation quality can be evaluated with regard to different aspects, such as accuracy (fidelity), fluency and fitness for purpose. In using a machine translation system for information purposes, accuracy of semantic content is the key aspect of quality. Automated quality metrics developed in the machine translation field have been criticized for conflating fluency of form with accuracy of content and for failing to provide any information on the types of errors in the translations. Our research aims to discover criteria for assessing translation quality specifically in terms of accuracy of semantic content in translation. This paper demonstrates how an error analysis with a view to identifying different error types in machine translations can serve as a starting point for such criteria. The error classification described focuses on mismatches of semantic components (individual concepts and relations between them) in the source and target texts. We present error analysis results, which show differing patterns both between human translators and machine translation systems on the one hand and two different kinds of translation systems on the other.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessing the Impact of Translation Errors on Machine Translation Quality with Mixed-effects Models

Learning from errors is a crucial aspect of improving expertise. Based on this notion, we discuss a robust statistical framework for analysing the impact of different error types on machine translation (MT) output quality. Our approach is based on linear mixed-effects models, which allow the analysis of error-annotated MT output taking into account the variability inherent to the specific exper...

متن کامل

Assessing the Impact of Speech Recognition Errors on Machine Translation Quality

In spoken language translation, it is crucial that an automatic speech recognition (ASR) system produces outputs that can be adequately translated by a statistical machine translation (SMT) system. While word error rate (WER) is the standard metric of ASR quality, the assumption that each ASR error type is weighted equally is violated in a SMT system that relies on structured input. In this pap...

متن کامل

Quality Assessment of the Persian Translation of John Steinbeck’s Of Mice and Men Based on Waddington’s Model of Translation: Application of Method A

Considering the statement that errors can affect the quality of translations, the need to adopt an objective model to analyze these errors has been one of the most debated issues in translation quality assessment. In recent decades, some objective models have emerged with an error analysis nature according to which evaluators can make decisions on the quality of translations. In this study, Met...

متن کامل

Ranking Translations using Error Analysis and Quality Estimation

We describe TerrorCat, a submission to this year’s metrics shared task. It is a machine learning-based metric that is trained on manual ranking data from WMT shared tasks 2008–2012. Input features are generated by applying automatic translation error analysis to the translation hypotheses and calculating the error category frequency differences. We additionally experiment with adding quality es...

متن کامل

The Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language

Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010